Picture for Xilin Chen

Xilin Chen

Contrastive Spectral Rectification: Test-Time Defense towards Zero-shot Adversarial Robustness of CLIP

Add code
Jan 27, 2026
Viaarxiv icon

T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models

Add code
Dec 30, 2025
Viaarxiv icon

Towards Transferable Defense Against Malicious Image Edits

Add code
Dec 16, 2025
Viaarxiv icon

Dual Attention Guided Defense Against Malicious Edits

Add code
Dec 16, 2025
Viaarxiv icon

Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity

Add code
Dec 16, 2025
Figure 1 for Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity
Figure 2 for Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity
Figure 3 for Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity
Figure 4 for Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity
Viaarxiv icon

VisKnow: Constructing Visual Knowledge Base for Object Understanding

Add code
Dec 09, 2025
Figure 1 for VisKnow: Constructing Visual Knowledge Base for Object Understanding
Figure 2 for VisKnow: Constructing Visual Knowledge Base for Object Understanding
Figure 3 for VisKnow: Constructing Visual Knowledge Base for Object Understanding
Figure 4 for VisKnow: Constructing Visual Knowledge Base for Object Understanding
Viaarxiv icon

VOPE: Revisiting Hallucination of Vision-Language Models in Voluntary Imagination Task

Add code
Nov 17, 2025
Viaarxiv icon

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Add code
Sep 19, 2025
Viaarxiv icon

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Add code
Jun 17, 2025
Viaarxiv icon

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Add code
Jun 09, 2025
Viaarxiv icon